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SYNTHESIS OF PROTEINS BY NATIVE CHEMICAL LIGATION 




Specification 

Field of Invention : 

The invention relates to methods and intermediates 
for chemically ligating two oligopeptides end to end with 
an amide bond. More particularly, the invention relates 
5 to methods and intermediates for chemically ligating 

oligopeptides wherein an unoxidized N-terminal cysteine 
of a first oligopeptide condenses with a C-terminal 
thioester of a second oligopeptide to form a p- 
aminothioester intermediate which spontaneously 
10 rearranges intramolecularly to form an amide bond and the 

ligation product. 

Government Rights : 

invention disclosed herein was supported in 
15 part by ftrants Number R01 GM 48897-01, Number P01 GM 

48870-03 iAand Number GM 50969-01 from the National 
Institute^/ dyf Health. The United States government may 
have certain Rights to this invention, 

20 Background : 

Proteins may be synthesized chemically, 
ribosomally in a cell free system, or ribosomally within 
a cell. Advances in each of these areas have 
significantly improved access to many proteins but have 

25 also stimulated demand for yet further improvements. 

Proteins owe their diverse properties to the 
precisely folded three dimensional structures of their 
polypeptide chains. The three dimensional structure of 
a protein determines its functional attributes. However, 

30 at present, it is difficult to predict and/or fully 

explain the biological properties of a protein from its 
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three dimensional structure alone . A better 

understanding of how structure determines the biological 
properties of a protein can be achieved by systematically 
varying the covalent structure of the molecule and 
5 correlating the effects with the folded structure and 

biological function. Accordingly, there is an increased 
demand for enhanced synthetic techniques for synthesizing 
new proteins and protein analogs. 

Techniques derived from recombinant DNA-based 

10 molecular biology can be employed to facilitate the 

expression of proteins in genetically engineered micro- 
organisms. The use of site-directed mutagenesis, as 
disclosed by M. Smith {Angew. Chem. Int. Ed. Engl. 
(1994): vol. 33, p 1214), enables the preparation of 

15 large numbers of modified proteins in useful amounts for 

systematic study, e.g., C. Eigenbrot and A. Kossiakoff, 
Current Opinion in Biotechnology (1992): vol. 3, p 333. 
The use of innovative approaches increases the range of 
amino acids that can be incorporated in expression 

20 systems and promises to significantly extend the utility 

of biosynthetic modification of the covalent structure of 
proteins. (C. J. Noren et al., Science (1989): vol. 244, 
p 182 (1989); J. A. Ellman et al., Science (1992): vol. 
255, p 197.) However, there appear to be limitations 

25 inherent to the nature of ribosomal protein synthesis. 

(V. W. Cornish, et al., Proc. Natl. Acad. Sci. USA 
(1994) : vol. 91, p 2910.) 

Chemical synthesis of proteins has also 
contributed to the exploration of the relationship of 

3 0 protein structure to function. Stepwise solid phase 
synthesis has permitted the de novo preparation of small 
proteins. (T. W. Muir et al., Curr. Opin. Biotech. 
(1993): vol. 4, p 420.) There are also several examples 
of the use of stepwise solid phase synthesis of whole 

35 proteins to explore the molecular basis of biological 
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function. (M. Miller, et al. f Science (1989): vol. 246, 
p 1149; A. Wlodawer, et al., Science (1989): vol. 245, p 
616; L.H. Huang, et al., Biochemistry (1991): vol. 30, p 
7402; and K. Rajarathnam, et al., Science (1994): vol. 
264, p 90.) 

Semi-synthesis through the conf ormationally- 
assisted religation of peptide fragments can also be 
employed, in special instances, to study of the 
structure/ function relationship of proteins. (R. E. 
Of ford, "Chemical Approaches to Protein Engineering", in 
Protein Design and the Development of New therapeutics 
and Vaccines. J, B. Hook, G. Poste, Eds., (Plenum Press, 
New York, 1990) pp. 253-282; C. J. A. Wallace, et al. , J. 
Biol. Chem. (1992): vol. 267, p 3852. An important 
extension of the semisynthesis approach is the use of 
enzymatic ligation of cloned or synthetic peptide 
segments. (L. Abrahmsen, et al., Biochemistry (1991): 
vol. 30, p 4151; T. K. Chang, et al., Proc . Natl. Acad. 
Sci. USA (1994): in press.) Although the above 

methodologies have been successfully applied to the 
synthesis of proteins and protein analogs, T. W. Muir et 
al., report that there is a continued interest in the 
wider application of the tools of organic chemistry to 
the study of proteins (Curr. Opin. Biotech. (1993): vol. 
4, p 420.) 

Stephen Kent et al. recently introduced the 
chemical ligation of unprotected peptide segments as an 
improved route to the total synthesis of proteins. (M. 
Schnlzer, et al., Science (1992): vol., 3256, p 221.) 
Chemical ligation involves the chemoselective reaction of 
unprotected peptides to give a product with an unnatural 
backbone structure at the ligation site. Use of 
unprotected peptides circumvented the difficulties 
inherent to classical chemical synthesis, viz complex 
combinations of protecting groups that lead to limited 
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solubility of many synthetic intermediates, e.g. K. 
Akaji, et al., Chem. Pharm. Bull. (Tokyo) (1985): vol. 
33, p 184. In contrast, the chemical ligation technique 
has allowed us to make good use of the ability to 
5 routinely make, purify, and characterize unprotected 
peptides 50 or more residues in length. Using optimized 
stepwise solid phase methods the preparation in good 
yield and high purity of peptides up to 60 residues is 
routine. In favorable cases, peptides of 80+ residues 

10 can be prepared. (M. Schnolzer, et al., Int. J. Pept. 

Prot. Res. (1992): vol. 40, p 180-193.) 

The key aspect of the above approach to chemical 
ligation is the use of a chemoselective reaction to 
specifically and unambiguously join peptides by formation 

15 of an unnatural (i.e. non-peptide) backbone structure at 

the ligation site. It has permitted the facile 
preparation of a wide range of backbone-modified 
proteins, including analogues of protein domains, e.g., 
ligated 10F3, the integr in-binding module of fibronectin: 

20 95 residues (M. Williams, et al., J. Am. Chem. Soc. 

(1994): in press.) The catalytic contribution of 
flap-substrate hydrogen bonds in HIV-1 protease has been 
elucidated by the chemical synthesis of a homodimer of 99 
residue subunits of this protein by chemical ligation. 

25 (M. Baca, et al., Proc. Natl. Acad. Sci. U.S., (1993): 

vol. 90, p 11638.) Chemical ligation has also proven to 
be useful for the routine, reproducible synthesis of 
large amounts of proteins in high purity with full 
biological activity (20). (R. C. deLisle Milton, et al., 

30 "Synthesis of Proteins by Chemical Ligation of 

Unprotected Peptide Segments: Mirror-Image Enzyme 
Molecules, D- & L-HIV Protease Analogs," in Techniques in 
Protein Chemistry IV , Academic Press, New York, pp. 
257-267 (1992) .) 

35 Chemical ligation can also be employed for the 
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straightforward production of protein-like molecules of 
unusual topology, e.g., four-helix bundle 
template-assembled synthetic protein (MW 6647 Da) (P. E. 
Dawson, et al., J. Am. Chem. Soc. (1993): vol, 115, p 
5 7263); homogeneous multivalent artificial protein (MW 

19,916 Da) (K. Rose, J. Am. Chem. Soc. (1994): vol, 31-16, 
p 3 0) ; artificial neoprotein mimic of the cytoplasmic 
domains of a multichain integrin receptor (MW 14,194 Da) 
(T. W. Muir, et al-, Biochemistry, (1994): vol. 33, pp 

10 7701-7708; and peptide dendrimer (MW 24,205 Da) (C. Rao, 

et al., J. Am. Chem. Soc. (1994): vol. 116, p 6975. The 
range of proteins accessible by this technique is limited 
by the size of the synthetic peptide segments. 

A useful extension would occur if one had direct 

15 synthetic access to native backbone polypeptide chains up 

to the size of typical protein domains. (A. L. Berman, et 
al., Proc. Natl. Acad. Sci. USA (1994): vol. 91, p 4044.) 
Chemical ligation would then be employed to string these 
domains together to explore the world of proteins in a 

20 general fashion. 

A modular strategy for the total synthesis of 
proteins has been developed, based on the convergent 
chemical ligation of unprotected peptides has been 
disclosed by L.E. Canne, et al. (presented at the Annual 

25 Meeting of the Protein Society, San Diego, July 1994) . 

Protein domains (modules) were prepared by chemical 
ligation of 50-70 residue segments; these domains were 
then stitched together to give the target protein. 
Mutually compatible ligation chemistries are required: 

30 intra-domain ligation should optimally yield a stable, 

peptide-like bond; inter-domain ligation will tolerate a 
wider variation of properties of the structure formed at 
the ligation site. 

Straightforward total chemical synthesis of 

35 proteins represents the realization of an important 
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objective of organic chemistry. It raises the exciting 
prospect of unrestricted variation of protein covalent 
structure made possible by general synthetic access, and 
will give new impetus to exploration of the structural 
5 basis of properties such as folding, stability, catalytic 
activity, binding, and biological action. 

What is needed is a technique of native chemical 
ligation which combines the formation of a native peptide 
bond at the ligation site with the advantages of 

10 chemoselective reaction of unprotected peptides. This 

second generation ligation chemistry would significantly 
increase the size of native backbone polypeptides 
directly accessible by total chemical synthesis. It 
could be usefully applied to a wide range of synthetic 

15 targets, including proteins of moderate size, and it 

allows direct access to protein functional domains. 
Native chemical ligation is a foundation stone of a 
general modular approach to the total chemical synthesis 
of proteins. Furthermore, it is compatible with the use 

20 of both chemically synthesized peptides and peptide 
segments derived from other sources. 
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Summary : 

<~*-~^C~~7 oYie aspect of the invention is directed to a 
AjM^ethod o A native chemical ligation. The method of native 
y chemical legation facilitates the chemical synthesis of 
5 ' proteins aW large oligopeptide. The principle of 
'native chemical ligation' is shown in Scheme 1. The 
first step \is the chemoselective reaction of an 
unprotected synthetic peptide-a-thioester with another 
unprotected pepVide segment containing an N-terminal Cys 
10 residue, to giv^\a thioester-linked intermediate as the 

initial covalent product. Without change in the reaction 
conditions, this intermediate undergoes spontaneous, 
rapid intramolecular reaction to form a native peptide 
bond at the ligation site. The target full length 
15 polypeptide product isNobtained in the desired final form 
without further manipulation. The general synthetic 
access provided by the method of native chemical ligation 
greatly expands the scopeVof variation of the covalent 

20 One embodiment of the invention provides a method 

for ligating a first oligopeptide with a second 
oligopeptide end to end for producing an oligopeptide 
product. The first and second oligopeptides are admixed 
in a reaction solution including a catalytic thiol. The 

25 catalytic thiol may be an unconjugated mercaptan or a 
conjugated thiol. Preferred catalytic thiols include 
benzyl mercaptan, thiophenol, l-thio-2-nitrophenol, .2- 
thio-benzoic acid, 2-thio-pyridine, 4-thio-2- 
pyridinecarboxylic acid, and 4-thio-2-nitro-pyridine. 

30 The first oligopeptide includes a Oterminal thioester. 

The second oligopeptide includes an N-terminal cysteine 
having an unoxidized sulfhydryl side chain. The 
unoxidized sulfhydryl side chain of the N-terminal 
cysteine is then condensed with the Oterminal thioester 

35 to produce an intermediate oligopeptide which links the 
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first and second oligopeptides with a £-aminothioester 
bond. The £-aminothioester bond of the intermediate 
oligopeptide then undergoes an intramolecular 
rearrangement to produce the oligopeptide product which 
5 links the first and second oligopeptides with an amide 

bond. 

Another aspect of the invention is directed to an 
oligopeptide intermediate which comprises a first 
oligopeptide segment having a C-terminal thioester, a 

10 second oligopeptide segment having a N-terminal cysteine, 
and a 0-aminothioester linkage unit which links the C- 
terminal thioester and the N-terminal cysteine. The p- 
aminothioester linkage unit spontaneously rearranges 
intramolecularly to form an amide bond linking the first 

15 and second oligopeptides segments end to end. 

^^frngfeh e r ar s pecc or rhe inventxpn^ is d i roc t -uU tojy 
method for producing an oligopeptide having a C-£eif!nTnal 
thioester. The method admixes a resin ha\jifig a linker 
with an unoxidized thiol with a y^oc-amino acid 

20 succinimide ester under reaction cpntfitions to produce a 
Boc-amino thioester-resin. oligopeptide is then 

assembled onto the Boc-am^ffo thioester-resin by stepwise 
solid phase peptide syfithesis. When the oligopeptide is 
complete, the tl>^Boc-amino thioester-resin is cleaved 

25 with HS to pifoduce an oligopeptide having a C-terminal 

thiol. ^^fhe C-terminal thiol is then converted to an 

(a-COSR moiety) of 



"A^^eme 1 



T'Hb oligopeptide thioester 

1 c^i be readily generated from a corresponding 
30 ;^oligopeptide\hiol (-aCOSH) prepared by highly optimized 
r stepwise SPPS oi\ a thioester resin. The thioester resin 
was prepared by\the method of L.E. Canne et al< 
Tetrahedron Letter£\ (1995) : vol. 36, pp. 1217-1220, 
incorporated herein b^ reference. The method of Canne 
35 employs the thioester ^esin disclosed by Blake and 
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Yamashiro (J. Bla^e, Jnt. J. Pept. Protein Res. (1981): 
vol. 17, p 273; DV Yamashiro, et al., Int. J. Pept. 
Protein Res. (1988) V vol. 31, p 322). Peptide products 
were cleaved, purifies, and characterized by conventional 
5 methods. (M. SchnolzeV, et al., Int. J. Pept. Protein 
^J&ejS-*~rT&92T^ 180-193.) 

The Yamashiro methodology activates a thiocarboxyl 
group on a protected oligopeptide with diaryl disulfides 
to give acyl disulfides (Yamashiro et. al. Int. J. 
10 Peptide Protein Res. (1922): vol. 31, pp 322-334). These 

C-terminal-peptide-acyl-disulf ides are highly reactive 
electrophilic intermediates which are attacked and 
subsequently coupled with an a-amino group on the N- 
terminus of a second peptide to form native peptide 
15 bonds. The reported coupling yields using 2, 2'-dipyridyl 

disulfide as the activator of the thiocarboxyl group 
afford the desired a-IB-92 product in 45% yield. Overall 
yields based on the starting resin for a 3 -segment 
synthesis of a-IB-92 are reported as 8%, while a 2- 
20 segment synthesis gave 11%. 

Due to the high reactivity of the diaryl disulfide 
bond, the Yamashiro approach requires extensive 
protection and deprotection of amino acid residues 
present in the peptide molecule. The lysine group for 
25 example is protected as a citraconyl derivative because 

of the reactive amine functionality. Additionally, an 
Msc group or tBOC group is used to protect any terminal 
amine functionalities present in the molecule. 

She invention stated herein does not require the 
use of any protecting groups for the coupling of two 
oligopeptides because a less reactive (and thus more 
chemoselecoive) thioester electrophile is used instead of 
the acyl disulfide moiety (Yamashiro' s approach) . In the 
intermoleculaV coupling step, this thioester electrophile 
35 requires a mone nucleophilic sulfhydryl moiety rather 
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than a\ f ree amine. The nucleophilic sulfhydryl moiety 
£■/ can be )found on cysteine residues. Since the amino and 
hydroxy A functionalities are relatively unreactive to the 
thioesteA electrophile, a selective coupling of the two 
5 unprotected oligopeptides is achieved with the cysteine 
sulfhydryl W>iety. The sulfhydryl group on the cysteine 
of peptide \ will first attack the thioester of peptide 
1 and form \a coupled thioester intermediate. This 
coupled thioester intermediate is concomitantly attacked 
10 by the free V-amino moiety from the cysteine and 

spontaneously rearranges to form the native peptide bond. 
Yields are therefore increased by eliminating protection 
and deprotection a±eps, since side undesired reactions 
a^e_re4ueed^C§chg3Dae\i ) . 
15 The thioester moiety is prepared from a precursor 

thioacid which is obtained by optimized stepwise solid- 
phase peptide synthesis on a aminomethyl resin support 
equipped with a thioester resin linker. The precursor 
thioacid is subsequently generated in liquid HF at 0 °C 
20 in 1 hour (Yamashiro et. al. Jnt. J. Pept. Protein Res. 

(1988) : vol. 31, p 322) . 

A procedure for the synthesis of the thioester 
linker with use of a stepwise solid phase peptide 
synthesis has been reported by Blake (Blake et. al. Proc. 
25 Natl. Acad. Sci. USA (1981): vol. 78, 4055) and Yamashiro 

(Yamashiro et. al. Int. J. Pept. Protein Res. (1988): 
vol. 31, p 322) . This method is undesirable, however, 
because it requires the conversion of Boc-amino acid 
succinimide esters to the corresponding Boc-amino 
30 thioacids with hydrogen sulfide. An improved methodology 

reported herein, utilizes the Boc-amino acid succinimide 
ester directly and therefore avoids the inconvenience and 
hazards of hydrogen sulfide gas (Kent et. al. Tetrahedron 
Lett. (1995): vol. 36, p 1217). 
35 — In this\iethod (Scheme 2), thiol 3 is generated 
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f rom >the reaction of chloride 2 (Yamashiro et. al. Int. 
J . Pefct. Protein Res. (1988): vol. 31, p 322) with 
thiourea, followed by hydrolysis of the resulting 
thiouronium salt in aqueous base. Thiol 3 is a general 
intermediate which can be reacted with a wide range of 
commercially available Boc-amino acid succinimide esters 
to produce the desired thioester linker 1 which is 
conveniently isolated as the dicyclohexylamine (DCHA) 
salt. 

ModelXstudies were undertaken with small peptides 
to investigate the native chemical ligation approach. To 
help explore the mechanism of the reaction, the peptide 
Leu-Tyr-Arg^Ala-Gly-or-COSBzl was reacted with Ac-Cys. 
The exact mafes of the resulting ligation product was 
determined by \electrospray mass spectrometry, and was 
consistent witfk a thioester-linked peptide as the 
ligation product generated by nucleophilic attack of the 
Ac-Cys side chain\ on the a-thioester moiety of the 
peptide. Reaction oj£ Leu-Tyr-Arg-Ala-Gly-a-COSBzl with 
H-Cys-Arg-Ala-Glu-Tyr^Ser (containing an unblocked a-NH2 
functional group) proceeded rapidly at pH 6.8 (below pH 
6 the reaction proceeded very slowly, suggesting the 
involvement of the ionizedNthiolate form of the Cys side 
chain), and gave a single pi^oduct of the expected mass. 
This product lacked susceptibility to nucleophiles, and 
had the ability to form disulf ide-linked dimeric 
peptides, indicating unambiguously the formation of a 
native amide bond at the ligationVsite. These studies 
were consistent with the mechanism sijown in Scheme 1, in 
which the initial thioester ligatiok product was not 
observed as a discrete intermediate because of the rapid 
rearrangement to form a stable peptides, bond. Facile 
intramolecular reaction results from ^the favorable 
geometric arrangement of the a-NH 2 moiety wrth respect to 
the thioester formed in the initial chemosel\ctive 




Scheme 2 
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ligation Reaction. Use of such 'entropy activation' for 
peptide bonck formation is based on principles enunciated 

f by Brenner. \m. Brenner, in Peptides. Proceedings of the 

Eighth European Peptide Symposium H. C. Beyerman, Eds. 
5 (North Holland \ Amsterdam, 1967) pp. 1-7.) The concept 

of 'entropy act Wat ion' for peptide bond formation has 
been more recently adopted by D. S. Kemp et al. (J. Org. 
Chem. (1993): volASS, p 2216) and by C.-F. Liu, et al. 
~ (*J. AM. Cijeju. Sou. \(1 9 94) I "voirr 116, p 4 14 9) . 

10 Several model peptides have been synthesized by 

the method of native chemical ligation. The successful 
synthesis of these model peptides establish that native 
chemical ligation is generally applicable to peptides 
containing the full range of functional groups normally 

15 found in proteins. Even free internal Cys residues may 

be present in either of the reacting segments. Internal 
Cys residues can undergo ester exchange with the 
peptide-a-thioester component however , this reaction is 
unproductive because no rearrangement to the amide bond 

20 can occur; the thioester formed is readily reversible and 
remains a productive part of the reacting system. As 
disclosed herein, native chemical ligation is limited to 
reaction at an N-terminal Cys residue. It is important 
to prevent the side chain thiol of this Cys from 

25 oxidizing to form a disulfide linked dimer, because this 

is unreactive in the ligation. An excess of thiol 
corresponding to the thioester leaving group was used to 
keep the Cys residues in reduced form without interfering 
with the ligation reaction. The amino-terminal peptide 

30 segment must be prepared by chemical synthesis to equip 

it with the necessary a-COSR functionality. Furthermore, 
for optimal ligation, this component should have an 
unhindered (i.e. non ^-branched) C-terminal amino acid. 
Solubilizing agents such as urea or guanidine 

35 hydrochloride did not interfere with the ligation and 
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could be used to enhance the concentration of peptide 
segments, and thus increase the reaction rate. 

irther model reactions demonstrate that the use 
'of betters, thioester leaving groups results in faster 
ligation reactions. We applied this observation to the 
native chemical ligation of peptides from the 
extracellular domain of a human cytokine receptor (R. 
D'Andrea, et aK, Blood, (1994): vol. 83, p 2802.) as 
shown in Scheme 5 A Use of the 5-thio-2-nitrobenzoic acid 
(-SNB) leaving groub, corresponding to the reduced form 
of Elman's reagent, gave rapid high yield reaction. As 
described below in connection with Scheme 5, the reaction 
between the peptide segments was observed to have gone 
essentially to completiork in less than 5 minutes, giving 
the 50 residue product with a native peptide bond at the 
site of ligation. Thus, rapid native chemical ligation 
can be achieved by use of a ^hioester leaving group with 
■s u i tably tuned pr o perties ^— 

Application of the nativ^ chemical ligation method 
o the total synthesis of a protein molecule was 
illustrated \y the preparation of human interleukin 8 
(IL-8) . (M. BWiolini, et al., FEBS Lett. (1989): vol. 
307, p 97; I. Cl&rk-Lewis, et al., J. Biol. Chem. (1994): 
vol. 269, p 16075\ (1994) ; I. Clark-Lewis, Biochemistry 
(1991): vol. 30, p\3i28; and K. Rajarathnam, et al., 
(1994): Biochemistry ,\( 1994 ) : vol. 29, p 1689.) The 72 
amino acid polypeptide chain contains four Cys residues, 
which form two functionally critical disulfide bridges in 
the native protein moleculeV The total synthesis of IL-8 
is shown in Scheme 7. TheVtwo unprotected synthetic 
peptide segments reacted cleanly to give the full length 
polypeptide chain in reduced\ form without further 
chemical manipulation (9) . This successful ligation was 
particularly significant because the 33- and 39-residue 
IL-8 segments each contained two CysXresidues, and 
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together encompassed 18 of the 20 genetically encoded 
amiho acids found in proteins. The purified product was 
folded and oxidized as previously described, to give IL-8 
with k mass precisely 4 daltons less than that of the 
originaY ligation product, indicating the formation of 
two disulfide bonds. The properties of this folded 
product were identical to those of previously studied 
authentic >IL-8 samples. Titration in an assay for 
neutrophil \elastase release demonstrated that the 
10 potencies (ED50 = 0.3nM) and maximal responses of the 

folded, ligated [Ala33]IL-8 and the corresponding 
molecule obtained by conventional synthesis were 
indistinguishable and identical to native sequence IL-8. 
This result unambiguously confirmed the formation of a 
15 peptide bond at tire ligation site, because the thioester- 

to-amide rearrangement must have taken place to give the 
free Cys 34 side chai^i that formed the native disulfide 

Proteins are usually studied by expression in 

20 genetically engineered micro-organisms using the methods 

of recombinant DNA-based molecular biology. Methods such 
as site-directed mutagenesis have had a major impact on 
the ability to prepare large numbers of modified proteins 
in useful amounts for systematic study. Innovative 

25 approaches have increased the range of amino acids that 

can be incorporated in expression systems and promise to 
significantly extend the utility of biosynthetic 
modification of the covalent structure of proteins. 
However, there appear to be limitations inherent to the 

3 0 nature of ribosomal protein synthesis. 

Wieland discloses a method for synthesizing 
dipeptides using a thioester intermediate. (Wieland et. 
al. Liebigs Ann. Chem. (1953): vol. 580, p 159.) Wieland 
utilizes the reaction of S-glycyl-(or other un-branched 

3 5 aminoacyl-) thiophenols with cysteine. Thus, the 
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sulfhydryl group on the cysteine residue first attacks 
the thioester of the S-glycyl-thiophenol and forms a 
coupled thioester intermediate. This coupled thioester 
intermediate is concomitantly attacked by the free a- 
amino moiety from the cysteine and spontaneously 
rearranges to form the native peptide bond. 

A limitation of the Wieland approach is the size 
of the molecules utilized (only mono-amino acids are 
coupled to cysteine) and stems from the methodology used 
for the synthesis of the thioester. To form the 
thioester, Wieland 's approach requires the activation of 
the terminal carboxylic acid as a mixed anhydride, acid 
chloride or thioacid. A problem arises if an acidic 
moiety is present in an amino acid residue such as Asp(D) 
and Glu(E). In these cases, the Wieland produces an 
undesired side reaction and therefore requires a complex 
protecting group strategy, particularly if oligopeptides 
are synthesized. 

The invention described herein eliminates the need 
for an elaborate protecting group strategy since the 
oligopeptide-thioester moiety is derived from a precursor 
thioacid. This precursor thioacid (peptide-ce-COSH) is 
synthesized by a standard stepwise solid-phase peptide 
synthesis on an aminomethyl resin support, equipped with 
a thioester resin linker. The precursor thioacid is 
cleaved from the linker/resin almost quantitatively (99%) 
in liquid HF at 0 °C for 1 hour. 

The thioester peptide (peptide-a-COSR) can be 
synthesized in two general ways: 

(1) Reaction of a crude lyophilized thioacid peptide 
(peptide-ct-COSH) with Ellman's reagent (5, 5'-dithiobis-2- 
nitrobenzoic acid, available from Aldrich company) at pH 
5.5 (2.0 equivalents), 6M Guanidine in lOOmM Na acetate 
buffer. This gives the SNB-thioester peptide (peptide-a- 
COSNB) which is subsequently purified by reversed phase 
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high performance liquid chromatography (RPHPLC) . 

(2) Reaction of a crude lyophilized thioacid peptide 
(peptide-a-COSH) with benzyl bromide at pH 4.0, 6M 
guanidine and lOOmM Na acetate buffer. The benzyl 
5 thioester (peptide-a-COSBn) is then purified by RPHPLC. 

Ttite conditions stated above, permit the formation of 
£/an unprotected oligonucleotide which is equipped with the 
y activated thioester. Subsequent reaction with a second 
peptide containing a terminal cysteine residue, permits 
10 a facile couplingN^i th tne formation of a native peptide 
bond and can generate\oUgopeptide chains of 100 or more 

• amine 1 ft "! * ^rJdnes (Sc henite*.!) . 

In favorable cases, chemical synthesis has already 
made important contributions to the exploration of the 
15 relationship of protein structure to function. Stepwise 
solid phase synthesis has permitted the de novo 
preparation of small proteins (14) and there have been 
several notable examples of the use of this method of 
total protein synthesis to explore the molecular basis of 
20 biological function. Another method that has in special 

instances allowed chemistry to be applied to the study of 
proteins is semi-synthesis through the conf ormationally- 
assisted religation of peptide fragments. An important 
extension of the semisynthesis approach is the use of 
25 enzymatic ligation of cloned or synthetic peptide 
segments. Although these methods currently have severe 
limitations, there continues to be serious interest in 
the wider application of the tools of organic chemistry 
to the study of proteins. 
30 Native chemical ligation provides precisely that 

capability. It combines the formation of a native 
peptide bond at the ligation site with the advantages of 
chemoselective reaction of unprotected peptides. This 
second generation ligation chemistry dramatically 
35 increases the size of native backbone polypeptides 
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directly accessible by total chemical synthesis. It can 
be usefully applied to a wide range of synthetic targets, 
including proteins of moderate size, and it allows direct 
access to protein functional domains. Native chemical 
5 ligation is a foundation stone of a general modular 
approach to the total chemical synthesis of proteins. 
Furthermore, it is compatible with the use of both 
chemically synthesized peptides and peptide segments 
derived from other sources. 

10 Straightforward total chemical synthesis of proteins 

represents the realization of an important objective of 
organic chemistry. It provides for unrestricted 
variation of protein covalent structure made possible by 
general synthetic access, and provides new impetus to 

15 exploration of the structural basis of properties such as 

folding, stability, catalytic activity, binding, and 
biological action. 

In an alternative embodiment, the carboxy-terminal 
peptide segment or protein module can be expressed by 

20 standard recDNA means; provided the product contained an 

N-terminal Cys residue, it could be reacted with the 
synthetic amino-terminal peptide-cr-COSR using the native 
chemical ligation described here to give a product in 
which part of the protein had derived from chemical 

25 synthesis and part from ribosomal synthesis. 




PCT/US95/05668 

22 - 

Detailed Description : 

Peptide-a-thioacid formation 

5 A typical procedure for the formation and 

utilization of the thioester resin linker for use in the 
solid phase synthesis of peptide-a-thioacids is as 
follows: (Kent et. al. Tetrahedron Lett. (1995) : vol. 36, 
p 1217) . 

10 

4- (a-Mercaptobenzyl) phenoxyacetic acid, dicyclo- 
hexylamine (3) Scheme 2. A mixture of 2, formed using 
the conditions as established by Yamashiro et. al. Int. 
J. Pept. Protein Res. (1988): vol. 31, pp 322-334, (7.5 

15 grams, 27 mmol) , thiourea (2.3 g, 30 mmol) , and ethanol 
(100 mL) were heated to reflux (conditions as reported by 
Koenig et al J. Org. Chem. (1958): vol. 23, pp 1525- 
1530) . After 4 hours, conversion to the thiouronium salt 
was essentially complete as shown by TLC (90:5:5 

20 chloroform: Methanol: Acetic acid) . 10N NaOH (30 ml) was 
added and the reflux continued for 2-3 hours. After 
cooling to room temperature, the reaction mixture was 
concentrated in vacuo to approximately half the original 
volume, acidified with concentrated HC1 (to pH 2.0), and 

25 extracted with ethylacetate (4 x 3 0 mL) . The combined 

ethylacetate extracts were washed with saturated NaCl (1 
x 3 0 mL) and dried over MgS0 4 . The volatile materials 
were removed in vacuo. The resulting oil was dissolved 
in ethylacetate (100 mL) and any insoluble material 

3 0 filtered. DCHA (dicyclohexylamine - available from 

Aldrich company), (6.0 mL, 30 mmol) was added to the 
filtrate with stirring. Within a few minutes, a white 
solid began to precipitate. Diethylether (150 mL) was 
added and the suspension cooled at -2 0 °C for several 

35 hours. The resulting white solid was filtered, washed 
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with Diethylether, and dried under vacuum to give 3 (10-3 
g, 23 mmol, 84%): l H NMR (CDC1 3 ) : S 7.30 (m, 7H) , 6.82 
(d, 2H, J=8.7 Hz), 5.39 (br s, 1H) , 4.40 (s, 2H) , 2.81 
(m, 2H) , 2.23 (br s, 1H, ex D 2 0), 1.88-1.02 (comp m, 20H) ; 
5 FAB MS (cesium ion): calc for [C^I^NOjS, H + ] 456.2572, 

found 456.2572. Anal. Calcd for C27H37NO3S: C, 71.17; H, 
8.18; N # 3.07; S, 7.04. Found: C, 71.11; H, 8.41; N, 
3.08; S, 7.09. 

10 General Synthesis of Boc-amino thioester linker (1), 

dicyclohexylamine salt (scheme 2). A mixture of 3 (3.67 
mmol), Boc-Ala-OSu (available from Novabiochem Corp.) 
(3.68 mmol), DIEA (diisopropylethylamine-5 . 74 mmol), 
dimethylformamide (35 mL) and methylene chloride (4 mL) 

15 was stirred at room temperature. After several hours, 
the initial white suspension completely dissolved to give 
a clear, colorless solution. After 24 hours, the 
reaction mixture was poured into IN HC1 (150 mL) and 
extracted with ethylacetate (4 x 35 mL) . The combined 

20 ethylacetate extracts were washed with IN HC1 (2 x 30 

mL) , H 2 0 (1 x 30 mL) , saturated NaCl (1 x 30 mL) and dried 
over MgS0 4 . Volatiles were removed in vacuo. The 
resulting oil was purified by flash chromatography 
(925:50:25 Chloroform :MeOH: acetic acid) to give an oil 

25 contaminated with Acetic acid. To remove residual Acetic 

acid, the oil was dissolved in Chloroform (40 mL) and 
washed with 0.1 N HC1 (7 x 10 mL) , saturated NaCl (1 x 10 
mL) and dried over MgS0 4 . Volatiles were removed in vacuo 
to give 1 as an oil. This oil was dissolved in 

30 diethylether (10 mL) to which was added dicyclohexylamine 
(1 equivalents). Hexane (100 mL) was added with stirring 
to separate the dicyclohexylamine salt of 1 as a thick 
oil from any unreacted dicyclohexylamine. Solvents were 
decanted from the oil and the oil dissolved in CH 2 C1 2 (30- 

35 40 mL) . The resulting solution was concentrated in vacuo 
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to give the dicyclohexylamine salt i as a white foamy 
solid (scheme 2) . 

An example of the linkage and synthesis on the resin 
is as follows: 4-[a-(Boc-Ala-S) benzyl ]phenoxyacetic acid 
5 (0.80 mmol) is added in 9 ml methylene chloride to 1.00 

g aminomethyl-resin (0.40 mmol) and treated at 0 °C with 
1.3 3 mL 0.6 M DCCI (dicyclohexylcarbodiimide) in 
methylene chloride for 15 min and at 24 °C for 3 0 
minutes. The product is then subjected to standard 
10 solid-phase peptide synthesis conditions (Kent et. al. 

^ Tetrahedron Letters (1995): vol. 36, p 1217; J. Blake, 

S Int. J. Pept. Protein Res. (1981): vol. 17, p 273). 

Once the desired chain is synthesized, the peptide resin 
g (approx. 45 nmol original load) is treated in 8 mL liquid 

CP 15 HF (0.8 mL anisole) at 0 °C for 1 hour. After 

n evaporation with nitrogen, the residue is washed with 

ethyl acetate. The solid is subsequently stirred in 
water (approx. 15 mL) at 0 °C while adjusting the pH to 
!l 6.0 with solid ammonium bicarbonate. Filtration and 

fQ 20 lyophilization gives the crude thioacid product which can 

0 be further purified by preparative HPLC in 3 0 mg batches. 
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Preparation of the thioester terminal peptide segment 

25 The a-COSR thioester peptide can be synthesized in two 

general ways: 

(1) Reaction of a crude lyophilized thioacid peptide 
with Ellman's reagent (5, 5'-dithiobis-2-nitrobenzoic 
acid, available from Aldrich company) at pH 5.5 (2.0 

3 0 equivalents) , 6M Guanidine in lOOmM Na acetate buffer. 

This gives the SNB-thioester peptide which is 
subsequently purified by reversed phase high performance 
liquid chromatography (RPHPLC) . 

(2) Reaction of a crude lyophilized thioacid peptide 
35 with benzyl bromide at pH 4.0, 6M guanidine and lOOmM Na 
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Scheme 3 
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acetate buffer- The benzyl thioester is then purified 
by RPHPLC. 

The conditions stated above, permit the formation of 
an unprotected oligonucleotide which is equipped with the 
5 activated thioester. Subsequent reaction with a second 
peptide containing a terminal cysteine residue, permits 
a facile coupling with the formation of a native peptide 
bond and can generate oligopeptide chains of 100 or more 
amino acid residues (Scheme 1) . 



10 
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Examples 

Example 1 

The model peptide Leu-Tyr-Arg-Ala-Gly-aCOSH 
5 (Sequence No.: 1) is prepared by optimized stepwise 

solid-phase peptide synthesis on an aminomethyl resin. 
The thioester resin linker is prepared by a generalized 
version as adopted from Kent et. al. Tetrahedron Letters, 
(1995): vol. 36, p 1217; J. Blake, Int. J. Pept. Protein 

10 Res. (1981): vol. 17, p 273; D. Yamashiro and C.H. Li, 
ibid. (1988): vol. 31, p 322. Once the desired chain is 
synthesized, the peptide resin (approx. 45 /mol original 
load) is treated in 8 mL liquid HF (0.8 mL anisole) at 0 
°C for 1 hour. After evaporation with nitrogen, the 

15 residue is washed with ethyl acetate. The solid is 
subsequently stirred in water (approx. 15 mL) at 0 °C 
while adjusting the pH to 6.0 with solid ammonium 
bicarbonate. Filtration and lyophilization gives the 
crude thioacid product which can be further purified by 

20 preparative HPLC in 3 0 rog batches. 

The thioester terminal peptide segment is 
subsequently prepared from the thioacid fragment by 
chemical synthesis to equip it with the necessary a-COSR 
functionality where R is an alkyl group such as benzyl, 

25 5-thio-2-nitrobenzoic acid (-SNB) , thiophenol, etc. The 

use of better thioester leaving groups resulted in faster 
ligation reactions. Thus, the model peptide Leu-Tyr-Arg- 
Ala-Gly-aCOSH (Sequence No.: 1) is first converted to the 
thiobenzylester by reaction with benzyl bromide (15 

30 equivalents) in 6.0 M guanidine-HCl , pH 4.6, sodium 

acetate buffer to form Leu-Tyr-Arg-Ala-Gly-ctCOSBn 
(Sequence No.: 3). The resulting peptide is purified 
under standard reversed-phase high-performance liquid 
chromatography (HPLC) conditions using approximately 20- 

35 45% acetonitrile at 1% per minute; monitored at 214 nm. 
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For the peptide H-Cys-Arg-Ala-Glu-Tyr-Ser (Sequence 
No.: 2), solid phase methods allow the preparation of 
peptides of up to 60 residues in good yield and high 
purity as described in M. Schnolzer, P. Alewood, D. 
5 Alewood, S.B. H. Kent, Int. J. Pept. Protein Res. 

(1992) : vol. 40, 180. 

First, to explore the mechanism of the reaction, the 
peptide Leu-Tyr-Arg-Ala-Gly-aCOSBn (Bn, benzyl; sequence 
No.: 3) was reacted with Ac-Cys (containing a blocked a- 

10 NH 2 functional group - commercially available from 

Novabiochem corp.). The exact mass of the resulting 
ligation product, Leu-Tyr-Arg-Ala-Gly-aCOS-CH 2 C (NHAc) C0 2 H 
(Sequence No.: 4), was determined by electrospray mass 
spectrometry and was consistent with a thioester- linked 

15 peptide as the ligation product generated by nucleophilic 
attack of the Ac-Cys side chain on the a-thioester moiety 
of the peptide. 

Finally, the reaction of Leu-Tyr-Arg-Ala-Gly-aCOSBn 
(Sequence No. : 3) with Cys-Arg-Ala-Glu-Tyr-Ser (Sequence 

20 No.: 2, containing an unblocked a-NH 2 functional group) 
proceeded rapidly at pH 6.8 (below pH 6.0, the reaction 
proceeded very slowly, suggesting the involvement of the 
ionized thiolate of the Cys side chain at pH 6.8; scheme 
3) and gave a single product of the expected mass. The 

25 peptides Leu-Tyr-Arg-Ala-Gly-aCOSBn + Cys-Arg-Ala-Glu- 

Tyr-Ser were reacted in 0.1 M phosphate buffer at pH 6.8, 
6.0 and 4.7 at 25 °C. After 1 hour, the reactions had 
proceeded as follows: at pH 6.8 >95%; at pH 6.0 
approximately 10%; and at pH 4.7, approximately 1.0% of 

3 0 ligated product Leu-Tyr-Arg-Ala-Gly-Cys-Arg-Ala-Glu-Tyr- 

Ser (Sequence No.: 5). As observed by HPLC, scheme 3 
shows the pH dependence of the reaction after 1 hour and 
at 25 °C. This product lacked susceptibility to 
nucleophiles and had the ability to form disulf ide-linked 

35 dimeric peptides, indicating unambiguously the formation 
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of a native amide bond at the ligation site. 
^^jq^ Another, unpublished model using the 2- 
rfcnioacetic acid derivative Leu-Tyr-Arg-Ala-Gly-SCH 2 COOH 
(Sequence No.Xs - , formed from attack of the thioacid 
5 Leu-Tyr-Arg-Ala-^Gly-SH (Sequence No. : 1) , onto 2- 
bromoacetic acid Yn methylene chloride) + Cys-Arg-A-la- 
Glu-Tyr-Ser (Sequence No.: 2) was ligated at pH 6.8 in 
0.2 M phosphate buffer, at 45 °C. After 1.0 hour the 
reaction had proceededxto 80% as observed in scheme 4 by 
10 HPLC. The isolation tff oxidation products from the 

ligated Leu-Tyr-Arg-Al^-Gly-Cys-Arg-Ala-Glu-Tyr-Ser 
(Sequence No.: 5) and unreaoted Cys-Arg-Ala-Glu-Tyr-Ser 
demonstrated the presence oS^ a free thiol ligation 
^p ro duct -: 

15 The native chemical ligation procedure is generally 

applicable to peptides containing the full range of 
functional groups, normally found in proteins. Even free 
internal Cys residues may be present in either of the 
reacting segments. Internal Cys residues can undergo 
20 ester exchange with the peptide-a-thioester component; 

however, this reaction is unproductive because no 
rearrangement to the amide bond can occur, the thioester 
formed is readily reversible and remains a productive 
part of the reacting system. 
25 The native chemical ligation procedure is limited to 

reaction at an amino-terminal Cys residue. To prevent 
the side chain thiol of this Cys from oxidizing to form 
a disulf ide-linked dimer, an excess of thiol 
corresponding to the thioester leaving group is used to 
keep the Cys residues in reduced form without interfering 
with the ligation reaction. In addition, small amounts 
of low molecular weight thiols such as benzyl mercaptan 
or thiophenol are added to the coupling reaction mixture 
to maintain a reducing environment. 



30 
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The addition of thiols increases the reactivity of 
the thioester, particularly if the added thiol is a 
better leaving group than the pre-formed thioester. An 
5 example of this observation is when the benzyl ester is 
converted to a phenyl ester by addition of thiophenol to 
reaction. Reaction yields and rates are substantially 
increased. For example, after 7 hours with benzyl 
mercaptan, the Barnase reaction yielded 25%, while the 
10 thiophenol treatment to the same reaction mixture yielded 

90%) . 

Addition of thiols to the ligation mixture also keeps 
the reaction mixture in a reduced form. This prevents 
oxidation of the reactive N-terminal Cys residues and 

15 when internal Cys residues are present, the thiols reduce 
the formation of intramolecular disulfide bonds. 
Additionally, the reducing environment increases the 
stability of the thioester segment (ligation reactions 
can proceed overnight with little or no hydrolysis at pH 

20 7.5). 

Example 2 

A rapid native chemical ligation is illustrated by 
the synthesis of a peptide segment corresponding to 

25 residues 46 to 95 from the external domain of the human 
IL-3 receptor 0-subunit incorporated herein: R.D' Andrea 
et. al., Blood (1994): vol. 83, p 2802. 

Crude synthetic IL-3 Msc (46-76) aCOSH was converted 
to the 5 thio-2-nitrobenzoic acid ester (-COSNB) by 

30 treatment with 5, 5'-dithio-bis (2-nitrobenzoic acid) [10 
equivalents (eq) ] in 8 M urea, pH 4.0, 50 mM ammonium 
acetate buffer [Msc, 2 (methyl-sulf onyl) -ethyloxy-carbonyl 
(Fluka #69227) protecting group is placed on the N- 
terminus using 1.1 equivalents (eq.) 2- 

35 (methylsulf onyl) ethyl 4-nitrophenyl carbonate, l.l eq. 
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diisopropylethylamine, and . 5 M dimethylf ormamide] . This 
thioester-containing material was found to be completely 
stable below pH 6.0, and was readily purified under 
standard reversed-phase HPLC conditions using 
5 approximately 20-45% acetonitrile at 1% per minute and 
monitored at 214 nm. 

As shown in scheme 5, ligation is initiated by adding 
IL-3 [Cys77] (77-95) (prepared by standard solid phase 
methods Kent et al. , Int. J. Pept. Protein Res. (1992): 

10 vol. 40, 180) to purified IL-3 Msc (46-76) aCOSNB at the 

stated pH and the reaction is monitored by UV [the 
substituted aryl thiolate leaving group has a 
characteristic UV absorbtion at 412 nm (e^, 412nm = 13, 
700 dir^mol^cm" 1 ) ] . At pH 7.0, the reaction is essentially 

15 complete within 5 min. No reaction is observed when 

Msc (46-76) aCOSNB is exposed to a 10-fold molar excess of 
Leu-enkephalin (amino-terminal residue, Tyr) at pH 5.0. 
This control experiment confirms the absolute requirement 
for an amino-terminal Cys residue at the site of ligation 

20 (scheme 5) . 

Purified IL-3 [Cys77] (77-95) (0.98 mM) and IL-3 (46- 
76)aCOSNB (0.9 mM) were reacted in 8 M urea, pH 5.0, 50 
mM ammonium acetate buffer at 23 °C (monitored by 
analytical HPLC 9C18 reversed phase 22.5 to 45% 

25 acetonitrile at 0.7% per minute; 214 nm) . After 1 hour, 

the ligation solution is exposed to the reducing agent 
tris(2-carboxyethyl)phosphine (TCEP) at pH 9.0 and 
subsequently raised to pH 13.0 to remove the Msc 
protecting group (Treatment with TCEP was found to aid in 
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the purification and analysis of product by reducing the 
thiophenol and benzyl mercaptan disulfide products which 
had a tendency to co-elute with the peptide products) . 
Scheme 6 shows the progress of the reaction by HPLC; 
5 conversion of the starting peptides to the crude product 
is shown (scheme 6) . The 50 residue product has the 
expected molecular mass by electrospray mass spectrometry 
[observed, 5747.0 daltons; calculated (average isotope 
composition), 5747.4 daltons]. The ligation product is 

10 shown to be stable at high pH, reducing conditions, and 

forms an intramolecular disulfide bond. These 
observations are consistent with the presence of a native 
peptide bond at the site of ligation. 

Analogous methods have required removal of protecting 

15 groups (J. Blake et. al., Int. J. Pept. Protein Res. 

(1981): vol. 17, p 273; Kemp et. al. J. Org. Chem. 

(1993) : vol. 58, p 2216; Liu et. al. J. Am. Chem. Soc. 

(1994) : vol. 116, 4149) or conversion of intermediates to 
the final form, or both steps. No previous method has 

20 allowed the chemical reaction of unprotected peptide 
segments to directly yield a native backbone final 
product. 

Example 3 

25 The IL-8 (34-72) segment (Sequence No. 9) is prepared 

by optimized stepwise solid-phase methods as described by 
Kent et al., Int. J. Pept. Protein Res. (1992): vol. 40, 
180 and yield peptides from 60-80 residues in good yields 
and high purities. The peptide-aCOSH is prepared by 

30 optimized stepwise solid-phase peptide synthesis on a 

aminomethyl resin with a thioester linker. The thioester 
linker is prepared by a generalized version as adopted 
from J. Blake, Int. J. Pept. Protein Res. (1981): vol. 
17, p 273; D. Yamashiro and C.H. Li, ibid, vol. 31, 322 

35 (1988) . Products are subsequently purified by standard 
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reversed-phase HPLC conditions and characterized by 
standard methods which include electrospray mass 
spectrometry • 

Crude synthetic segment IL-8 ( 1-3 3 ) aCOSH ( Sequence 
5 No.: 7) is converted to the thiobenzyl ester by reaction 

with benzyl bromide (15 equivalents) in 6 M guanidine-HCl 
at pH 4.6 in 100 mM sodium acetate buffer. The reaction 
mixture is purified under standard reversed-phase HPLC 
conditions and forms the thiobenzyl ester, IL-8(l- 

10 33)aCOSBn (Sequence No.: 8), (Scheme 7). 

The segments IL-8 (1-3 3) aCOSBn (Sequence No. : 8), (5.0 
mg, 1.3 jitmol) and IL-8(34-72) (Sequence No. 9), (4.8 mg, 
1.1 mmol) were reacted in 0.5 ml 6.0 M guanidine-HCl, pH 
7.6, phosphate buffer at 23 °C in the presence of benzyl 

15 mercaptan (5 ml) . After suitable reaction time (48 to 72 

hours) , a ligation yield of approximately 60% was 
obtained. The product was purified by standard reversed- 
phase HPLC as described via supra and characterized by 
electrospray mass spectroscopy. 

20 As shown in scheme 8B, an analytical HPLC spectrum 

(C 18 reversed phase; 25 to 45% acetonitrile at 1% per 
minute; monitored at 214 nm) is shown before the reaction 
of the synthetic peptide segments IL-8 (1-33) aCOSBzl and 
IL-8(34-72) . 

25 As shown in scheme 8C, an analytical HPLC spectrum 

(C 18 reversed phase; 25 to 45% acetonitrile at 1% per 
minute; monitored at 214 nm) of the purified ligation 
product, IL-8 (1-72) (SH) 4 (Sequence No.: 10), in fully 
reduced form. (Inset) Electrospray mass spectrum (raw 

30 data displayed as a single charge state) : observed 
molecular mass 8319.8 daltons; calculated molecular mass 
(average isotope composition), 8319.8 daltons. 

As shown in scheme 8D, air oxidation of the purified 
1-72 ligation product forms the folded [Ala 33 ] IL-8 

35 molecule, shown after HPLC purification. The earlier 
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elution of the folded, disulfide cross-linked native 
protein compared with the reduced polypeptide is typical 
(see Lewis et. al FEB Lett. (1989): vol. 307, p 97; 
Lewis et al *7. Biol Chem. : vol. 269, p 16075; Lewis et. 
5 al Biochemistry (1991): vol. 30, 3128). 

Folding and oxidation conditions: polypeptide at -0.2 
mg/ml, 1M guanidine-HCl, pH 8.5 tris buffer, and vigorous 
stirring in air at ambient temperature (inset) . 
Electrospray mass spectrometry of the oxidized and folded 
10 synthetic IL-8 (raw data displayed as a single charge 
state). Observed molecular mass, 8315.6 daltons; 
calculated molecular mass (average isotope composition) , 
8315.8 daltons. (Scheme 8). 
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15 Example 4: HIV-1 K41 protease (unpublished conditions) 

Ligation reactions are performed in several ways. 
An optimized procedure for a ligation reaction involving 
a (5-thio-2-nitrobenzoic acid) SNB thioester is to weigh 
the two peptides, HIV (1-40)-COSNB (Sequence No.: 11, 

20 formed from standard conditions stated herein) and HIV 

(41-99) (Sequence No.: 12, formed from standard 
conditions stated herein) , as solids in the same reaction 
vessel and add 6.0 M guanidine HC1 pH 6.5 with 100 mM Na 
acetate (the approximate peptide concentration is 7-13 

25 mg/mL of each peptide) . 

After 5 min, approx. 2.0 % thiol is added. Two 
thiol catalysts have been used, viz. benzyl mercaptan 
(forms the benzyl thioester insitu; Sequence No.: 13) and 
thiophenol (forms the phenyl thioester insitu; Sequence 

30 No.: 14) . In the ligation of HIV PR, reaction with benzyl 

mercaptan gave greater than 60% product yield in 4 0 hours 
while the thiophenol gave greater than 80% product yield 
in 10 hours to form HIV-1 K41 protease (Sequence No. : 
15) . 

35 Subsequent treatment with TCEP was found to aid in 



WO 96/34878 PCT/US95/05668 

- 38 - 

the purification and analysis of product by reducing the 
thiophenol and benzyl mercaptan disulfide products which 
tend to co-elute with peptide products. The product was 
purified by standard reversed-phase HPLC as described via 
5 supra and characterized by electrospray mass spectroscopy 
(Scheme 9) . 

Example 5. Barnase example (unpublished conditions) 

The two peptides Barnase (1-48) -SNB (Sequence No.: 

10 16, formed from standard conditions stated herein) and 

Barnase (49-110) , (Sequence No.: 17, formed from standard 
conditions stated herein) were weighed as solids in the 
same reaction vessel and dissolved in pH 7.5 buffer (6M 
Guanidine lOOmM phosphate) . Immediately upon dissolving 

15 the peptides , 2 % benzyl mercaptan ( forms the benzyl 
thioester insitu; Sequence No.: 18) or 4% thiophenol 
(forms the phenyl thioester insitu; Sequence No.: 19) was 
added. After 7 hours, the benzyl mercaptan reaction 
proceeded 25% and the thiophenol reaction proceeded to > 

20 90% to form Barnase (1-110) (Sequence No.: 20). The 

product was purified by standard reversed-phase HPLC as 
described via supra (scheme 10) . 

The addition of thiols increases the reactivity of 
the thioester, particularly if the added thiol is a 

25 better leaving group than the pre-formed thioester. An 

example of this observation is when the benzyl ester is 
converted to a phenyl ester by addition of thiophenol to 
reaction. Reaction yields and rates are substantially 
increased. For example, after 7 hours with benzyl 

30 mercaptan, the Barnase reaction yielded 25%, while the 

thiophenol treatment to the same reaction mixture yielded 
90% to form Barnase (1-110) (Sequence No.: 20). 

Addition of thiols to the ligation mixture also keeps 
the reaction mixture in a reduced form. This prevents 

35 oxidation of the reactive N-terminal Cys residues and 
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when internal Cys residues are present, the thiols reduce 
the formation of intramolecular disulfide bonds. 
Additionally, the reducing environment increases the 
stability of the thioester segment (ligation reactions 
can proceed overnight with little or no hydrolysis at pH 
7.5) . 
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B ARNASE Ki39 
1-48COSR + 49(i(:YS)-110 



7 Hours 4% thiophenol 




7 Hours 4% benzyl mercaptan 
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